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Design Considerations for Very-High-Speed 
Si-Bipolar IC's Operating up to 50 Gb/s 



H.-M. Rein, Member, IEEE, and M. Moller 



Abstract — In this paper, design aspects of high-speed digital 
and analog IC's are discussed which allow the designer to exhaust 
the high-speed potential of advanced Si-bipolar technologies. 
Starting from the most promising circuit concepts and an ade- 
quate resistance level, the dimensions of the individual transistors 
in the IC's must be optimized very carefully using advanced 
transistor models. It is shown how the bond inductances can 
be favorably used to improve circuit performance and how the 
critical on-chip wiring must be taken into account. Moreover, 
special modeling aspects and ringing problems, caused by emitter 
followers, are discussed. An inexpensive mounting technique is 
presented which proved to be well suited up to 50 Gb/s, the high- 
est data rate ever achieved in any IC technology. The suitability 
of the design aspects discussed is confirmed by measurements of 
digital circuits and broadband amplifiers developed for 10 and 
40 Gb/s optical-fiber links. 



I. Introduction 

SILICON bipolar circuits are just penetrating speed ranges 
which some years ago still seemed to be reserved for com- 
pound semiconductors (e.g., GaAs). For example, today there 
is no question that all circuits in 10 Gb/s optical-fiber links can 
be fabricated in available advanced Si production technologies 
(e.g., [I] and [2]). The same is true for most circuits in wireless 
applications with typical frequencies between 0.9 and 2.5 
GHz or even 5.8 GHz. Recently, silicon MUX and DEMUX 
IC's were realized in a laboratory technology operating above 
40 Gb/s without needing sophisticated processes [3], [4]. Si- 
based IC's for even higher operating speeds are just under 
development. 

An important precondition for this speed improvement is 
the progress in silicon bipolar technologies. However, digital 
circuits with clock frequencies as high as the transistor transit 
frequency f T or broadband amplifiers with cutoff frequencies 
up to about /r/2 are only possible if adequate circuit concepts 
are used and if the circuits are carefully designed. Such a 
design does not only include the optimization of the circuit 
resistances (and thus of the transistor operating points and 
voltage swings) but also the individual design of all transistors 
in the circuit as well as the optimization of on-chip wiring and 
even of the length of the bond wires. All these aspects are 
discussed in this paper. The basic steps in high-speed circuit 
design are summarized in Fig. 1, starting with the choice of 

Manuscript received October 1, 1995; revised February 23, 1996. This work 
was supported in part by the European Union (R&D Programs RACE and 
ESPRIT) as well as by the German Ministry BMBF (Project Photonik II). 

The authors are with Ruhr-University Bochum. D-44780 Bochum, Ger- 
many. 

Publisher Item Identifier S 0018-9200(96)05707-1. 



1 Target Specified 



Transistor 
Models 



Circuit Principles 
Circuit Diagram 



-« Technology j 



Intrinsic Circuit Design 
especially optimization of 
resistances and transistors 



Line Models 
(on chip) 



Layout (optim. placement) 

metaJJiz. parasitica, crosstalk 



Models for 
Mounting 
Parasitics 
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Fig. 1 . Basic steps in the design of high-speed circuits. For simplification 
the iteration loops between the different steps are not shown. 



optimum circuit principles and finishing by simulating the 
complete IC with all parasitics. 1 

The circuit examples discussed in this paper are digital 
IC's and broadband amplifiers, e.g., used in optical-fiber links. 
For this application, maximum speed is often a must, while 
the power consumption is not a limiting condition. However, 
most of the circuits are of interest for other applications, 
too. Moreover, the design aspects presented here are also 
applicable to low-power high-speed circuits (as shown by a 
single example) and most of them also to other kinds of circuits 
(e.g., for wireless applications). 

Numerous analog and digital high-speed IC's have been 
designed by the authors' group (RUB) considering the aspects 
discussed in this paper. In Table I, some examples developed 
in cooperation with our industrial partners (given in paren- 
theses) and fabricated in production technologies of different 
companies are summarized, together with the corresponding 
references. In the last column, the most recent results based on 
a laboratory technology are given. All these circuits achieve 
record values for silicon and some of them are even faster 
than the fastest compound semiconductor IC's. Some of the 
most critical IC's (with respect to achieving the demanded 
specifications) are shortly discussed at the end of this paper. 

II. Si-BrpoLAR Technologies 

Today, the most promising production technologies are 
based on self-aligning double-polysilicon processes. The 

1 In contrast to the simplified scheme in Fig. 1 , design experts often use 
preliminary values for metallization and mounting parasitics already in the 
first optimization step of the circuit in order to reduce the number of iterations. 
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TABLE I 

High-speed IC's for Optical Communications in Si-BiPOLAR Production Technologies— State of the 
Art. Record Speeds in a Laboratory Technology Are Given in the Last Column (Siemens -f RUB) 



Circuit 


Max SpMd / Prod. Technology 


Remarks 


Labor. T«chn 


Multiplexer 


30GbJs(RUB + HP)/HP25 [5] 


2:1 


50 Gb/i [4] 


Decision Circuit 


22 Gb/s (RUB + Siemens) / B6HF [6] 


CPM«180* 


25Gb/> [12] 


Demultiplexer 


32 Gb/s (RUB + Siemens) / B6HF [6] 


1:2 


46 Gb/s pj 


Static Frequency 
Divider 


19 GHz (RUB + Siemens) /B6HF [7] 


2:1, 2 Outputs (0* and 90*) 


30 GHz (3] 


Laser Driver 


14 Gb/i (RUB + Siemens) / B6HF [8] 


Swing AV Q - 3.6 V (50 0) 


20 Gb/s [22] 


Preamplifier 
(Trans imp. Type) 


13 Ob/s (RUB ♦ ANT + Siemens) f91 
/B6HF 1 1 


Transimp. Zti 3 615 Q 
f c = 10 GHz 

Noise j N = 10J pA/VHl 


20 Gb/s [22] 


Main Amplifier, 
AOC 

Main Amplifier, 


13 GWs (RUB ♦ ANT ♦ Motorola) 

/Advanced MOSAIC V l lw i 

15 Gb/i (RUB + ANT + Siemens) .... 

/B6HF lUJ 


S21 » 37 dB, f c - 10 GHz 
Dynamic Range = 41 dB 
S71 ° 52 dB (max.) 
Dynamic Range ■ 46 dB 





main advantage of double-polysilicon compared to single- 
polysilicon technologies is the lower base resistance. A typical 
example of an advanced production technology is the B6HF 
of Siemens [13]. It is characterized by an fa of about 25 
GHz, low base resistance, 0.8 ^ra lithography (resulting in 
0.4 pxn emitter stripe width), three metallization levels, and 
a rather conservative isolation technique. Most of the circuits 
presented in Table I are realized in this production technology 
or in an improved but unsophisticated laboratory version [14]. 
Even higher operating speeds can be achieved by use of SiGe- 
base technologies (e.g., [2]), as, e.g., shown by our current IC 
designs for a 40 Gb/s optical system [22]. 

For choosing the right technology for high-speed ICs, it is 
highly recommended not only to take a look at maximum f T . 
A slightly better "figure of merit" is a simplified relation for 
the maximum oscillation frequency / max which, in addition, 
contains the base resistance tb and the total collector-base ca- 
pacitance Ccb (consisting of junction and oxide capacitance) 

/max = 

Instead of using a single but inadequate figure of merit, the 
experienced designer prefers to take a look at the individual 
(area- and length-) specific electrical parameters in order to 
judge a technology. Especially the base resistance and emitter 
contact resistance, as well as the various junction and oxide 
capacitances, are considered (all related to the emitter size) 
and, of course, the total transit time t/. 2 

One of the weak points of / max as a figure of merit is 
that the current carrying capability, given by the maximum 
admissible collector current density jcK (see Section IV-B), 
is not considered. As a consequence, record values for 

/max* 

achieved by weakly doped and thick epitaxial collector regions 
[16], are dearly paid by a low jcK which is not adequate 
for most applications. Note that j'ck is a very important 
transistor parameter, since the high-speed driving capability of 
the transistor is strongly influenced by Jck • A E /Cc3 We = 
emitter area). In practice. j*ck is increased by a selective 

2 From these parameters, the designer can derive other figures of merit 
which are tailored to a specific class of circuits. 



implantation in the collector (SIC) region below the emitter 
which additionally can increase fa also at medium current 
densities. 

III. High-Speed Circuit Principles 

A. Digital Circuits 

Fig. 2 shows the circuit diagram of a master-slave D- 
flipflop (MS-D-FF) as an example for a frequently used 
circuit. It consists of the circuit core with data input stages, 
a clock buffer, and an output buffer for driving matched 50- 
n transmission lines. This circuit equals the decision circuit 
in Table I and is also the basic component of the frequency 
divider and the demultiplexer given there. From this diagram, 
most of the circuit principles recommended for high-speed 
IC's can be taken which shall be summarized briefly. 

• E?CL instead of ECU This means that the emitter fol- 
lowers (EF) are used at the data inputs and the limiting 
differential stages, called current switches (CS), at the 
output. Among other advantages, this circuit concept 
shows essentially improved behavior in a transmission 
line environment compared to ECL (e.g., better line 
matching and steeper pulse edges). 

• Series Gating. Besides high-speed logic connection (e.g., 
between data and clock signals) this concept allows 
differential operation. 

• Differential Operation. This principle results in many 
advantages, which are summarized in Fig. 3. 

• Low Voltage Swing. This measure increases speed (es- 
pecially of the CS) and reduces power consumption. 
Optimum values for internal differential swings of the 
data pulses are usually about 2 x 200 mVp_ p = 400 
mVp_ p . 

• Multiple Emitter Followers. At high operating speed, 
the decoupling capability (impedance transformation) of 
emitter followers is rather limited due to the reduced 
effective current gain of the transistors. Thus two or 
even three cascaded emitter followers are often required. 
Moreover, by this measure Vce across the CS transistors 
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data input 
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rtrr 
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Fig. 2. Circuit diagram of a high-speed master-slave D-flipflop (MS-D-FF). 
(a) MS-D-FF core with data input stage, (b) Clock buffer, (c) Data output 
buffer. 



complementary signals lower voltage swing 

with symmetrical transients across the load resistors 



no reference voltage 
for the CSs required 








simpler and more simpler mounting higher data rates at reduced power 

reliable design technique improved eye diagrams consumption 



Fig. 3. Advantages of differential operation (compared to single-ended op- 
eration). 

and thus the maximum collector current density j C K are 
increased. As a consequence the transistor size can be 
reduced, further reducing Ccd- 
• On Chip Matching of external transmission lines, espe- 
cially at the data input. Compared to off-chip matching, 
the return loss is essentially improved and the poten- 
tial instability of emitter-follower inputs can be reliably 
suppressed [17], [18]. 



B. Broadband Amplifiers 

Amplifiers in broadband communications must be operated 
down to very low frequencies (e.g., down to about 30 kHz in 
optical-fiber links). Thus, in monolithic integrated amplifiers 
dc coupling is necessary. Moreover, it is recommended to 
apply the principle of strong impedance mismatching between 
succeeding stages. As a typical example, a chain of alternating 
transadmittance stage (TAS), transimpedance stage (TIS), and 



EFs 



TAS 



TIS 



EFs 



TAS 




Double Stage by Cherry and Hooper 

Fig. 4. Principle of strong impedance mismatching in broadband amplifiers. 
Z stands for the input and output impedances, respectively. 



emitter followers (EFs) is shown in Fig. 4. 3 Due to this 
mismatching, the transfer functions of the different stages (e.g., 
transadmittance and transimpedance) remain approximately 
constant up to comparatively high frequencies, resulting in 
a high cutoff frequency. This is because of the drastically 
reduced influence of the strongly frequency-dependent input 
and output impedances of the different stages. As another 
advantage of mismatching, all nodes in the circuit are low- 
ohmic. This fact reduces the influence of parasitic capacitances 
and thus further increases the bandwidth. In Fig. 4 the original 
double-stage proposed by Cherry and Hooper [19], consisting 
of a TAS and a TIS, is extended by (one to three) EF's between 
the output of the TIS and the input of the TAS. These EF's 
are required for improving the insufficient mismatching at this 
interface, for level shifting, and for gain peaking near the upper 
frequency limit. At high frequencies, where mismatching is 
degraded, conjugate complex input and output impedances can 
be observed at the interfaces between different stages, which 
can further increase the bandwidth. 

In the receivers of optical links (without an optical-fiber 
amplifier in front of the photodiode) we need a low-noise 
preamplifier behind the photodiode, succeeded by a high-gain 
main amplifier. Both circuits must be designed for a wide 
dynamic range. Usually, they are realized on separate chips 
connected by transmission lines [20]. In the preamplifier, 
the first stage of the chain of mismatched stages is generally a 
TIS in order to guarantee broadband mismatching between the 
driving (high-impedance) photodiode and the amplifier input. 
This stage is followed by emitter followers and an output 
buffer (TAS) with external 50-Q loading [9], [21]. Using EF's 
at the output is not recommended (as in the case of digital 
circuits). 

The main amplifier must have a high gain at 50-H input 
impedance. Therefore, often on-chip matching and a single EF 
pair are used at the input, followed by several basic amplifier 
cells (cf. Fig. 4) and a TAS output buffer [10], [11]. Since 
the main amplifier must have a constant output amplitude, 

3 Sometimes the cutoff frequency of the gain magnitude is increased by 
using an EF in the feedback path of the TIS [23]. However, by this measure the 
group delay is often degraded, even resulting in a reduction of the maximum 
usable operating speed (cf. Section 1V-B). 



REIN AND MOLLER: DESIGN CONSIDERATIONS FOR VERY-HIGH-SPEED Si-BIPOLAR ICS OPERATING UP TO 50 Gh/s 



1079 



EFs 



extension for \ 
gain control 



gain 
control 



input 




output 



6 V 0 (<0) 



Fig. 5. Differential broadband amplifier cell. The shaded circuit part is 
required for AGC amplifiers only. The realization of the adjustable peaking 
capacitance Ce is, e.g., given in [40]. 
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independent of the input signal amplitude, two solutions are 
possible: limiting amplifier or automatic-gain-control (AGC) 
amptifier. If possible, differential operation should be preferred 
(cf. Section II1-A, Fig. 3). A differential version of the basic 
amplifier cell in Fig. 4 is shown in Fig. 5. Without the shaded 
circuit part, it has a constant gain at small input amplitudes, 
but it operates also excellently in the limiting range and is, 
therefore, well suited for limiting amplifiers (but often with 
Re = 0) [11]. For AGC amplifiers the gain of the amplifier 
cell must be controllable. To our experience, the best results 
for such amplifiers with respect to flat gain response and high 
cutoff frequency over a wide dynamic range are obtained if the 
TAS is extended (by the shaded circuit block in Fig. 5) to a 
four-quadrant multiplier. In this example, the gain is controlled 
by a dc voltage at the upper input. 

Broadband amplifiers operating near the limit of a given 
technology are more sensitive to fabrication spread and design 
uncertainties (e.g., errors in modeling of the transistors as 
well as of the metallization and mounting parasitics) com- 
pared to digital circuits. Moreover, they are more sensitive to 
changes in the circuit environment (e.g., to the driving source 
impedance). Therefore, it is highly recommended to look for 
measures which enable us to correct the frequency response 
after fabrication or, if possible, even after chip mounting. 
This especially holds if the circuits are designed in an early 
stage of technology and system development, as for most 
of our circuits. 4 One example is to adjust the gain peaking 
capacitance C E in Fig. 5. This is possible by cutting links 
of a multicapacitor network in the upper metallization level 
(by a laser or ultrasonic cutter). Another very simple way, but 
with a limited capacitance range, is to realize Ce by junction 
capacitances which can be varied by adjusting the dc bias 
voltage via external potentiometers [40]. We often combine 
both measures. In addition to Ce we sometimes use a narrow- 
band peaking network at the emitter node of the TAS [9]. It 

4 In a later stage, if the data of system and technology are well established, 
this option of frequency-response correction may no longer be necessary. 



transimp. stage 
S_ (towZ,) 




H 



e o to 12 

f [GHz] -~ 



(C) 



Fig. 6. Examples for favorable use of bond inductances, (a) Transmission 
line matching at IC input (b) Steepening of pulse edges at IC output, (c) 
Improvement of signal-to-noisc ratio (preamplifier). 

consists of a junction capacitance in series to the (inductive) 
output impedance of cascaded emitter follower stages. This 
circuit is well suited to adjust the magnitude of the gain and 
the group delay near the upper frequency limit of the amplifier. 
Moreover, for fine-adjustment of the frequency response, it is 
sometimes useful to vary (via an external potentiometer) the 
current through that EF which drives the TAS. 

C. Utilization of Bond Inductances for 
Improving Circuit Performance 

A certain disadvantage of present Si technologies compared 
to compound semiconductor technologies is that adequate on- 
chip inductors at frequencies above 5 GHz are usually not 
available. Thus we have to think about improvement of circuit 
performance by applying bond wires as inductors [1]. Some 
examples for the favorable use of bond inductances in high- 
speed digital and broadband amplifier ICs are given in Fig. 6. 

Good matching of the driving transmission lines, espe- 
cially at the data inputs, is a must in many high-speed ICs. 
Otherwise, double-reflections can increase the time jitter con- 
siderably [17], [18]. Fig. 6(a) demonstrates how this matching 
is improved by the bond inductance Lb- In this example 
the circuit input, which is roughly approximated by an RC 
network, is driven by a microstrip line with the characteristic 
impedance Z 0 = 50 H. Rj w Z 0 is the on-chip matching 
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resistance (cf. Section III-A) and Cj represents the total input 
capacitance of the IC, consisting of bond pad capacitance and 
averaged input capacitance of the first (loaded) EF. Mostly we 
use the min imum possib le bond inductance L B min- Even then, 
typically y/ L B min/ 'Ci > Z 0 holds. Then a capacitance AC 
is added to the end of the line 5 so the condition 

is approximately met. The LC network built by AC, L B , and 
Ci acts roughly as a lengthening of the line up to the on- 
chip matching resistor. Of course there is an upper frequency 
limit of this matching method due to the cutoff frequency 
of the (lumped) LC network and, moreover, due to the fact 
that the input of the circuit can no longer be modeled by a 
constant capacitance at high frequencies. It should be noted 
that the situation is simplistically presented in Fig. 6(a) (cf. 
[18]). For example, at single-ended operation, the ground 
bond inductance [not shown in Fig. 6(a)] influences the input 
impedance of the circuit and must also be taken into account 
[ 10]. As a practical example, an input return loss of S n < -20 
dB up to 11 GHz was measured for a limiting amplifier 
fabricated in a production technology [11]. 

The right-hand side of Fig. 6(a) shows a simulated example 
based on an advanced laboratory technology. Here, the return 
loss at the data input of a high-speed digital IC [cf. Fig. 2(a)] 
is plotted versus frequency. In this case, C/ ss 60 fF and 
L B = 0.3 nH. From (2) we get AC as 60 fF. The result 
shows that there is a good chance to obtain a fairly good line 
matching up to 30 GHz, which is sufficient for data rates up 
to about 40 Gb/s. As shown by the dashed line, S u is much 
worse without the Lb - AC network. 

Fig. 6(b) demonstrates steepening of pulse edges by a 
peaking bond inductance. Here, the output stage of a digital 
IC is approximated by a switched current source (J 0 ) with 
the output capacitance Cq and the on-chip output resistance 
Rq. In this case, Rq is connected to ground via a bond 
wire with inductance L B . As a consequence, fall and rise 
times are reduced compared to the value determined by the 
output time constant tq, which is given by the effective 
load resistance (Rq//Z 0 ) and the output capacitance Cq. 
This is shown by comparison of the solid pulse shape for 
optimum L B {- L Bopt ) with the dashed shape for L B = 0. 
This measure is of special importance for driver circuits with 
high output voltage swing AVq, since there tq is roughly 
proportional to AVq. For example, it was required to meet 
the demanded specifications of the laser/modulator driver in 
Table I (cf. Section VII-A and [8]). 

In Fig. 6(c) it is shown how the signal-to-noise ratio (de- 
termined by the reciprocal value of the equivalent input 
noise current density j N ) can be increased in the case of a 
transimpedance preamplifier by use of an (overlength) bond 
wire [9], [21], [24]. For this, the input signal amplitude is 
peaked near the cutoff frequency f c of the amplifier by the 
parallel resonance circuit consisting of L B and the capacitance 
Cp D of the photodiode (PD). Note that damping of this 

5 AC can be realized in a simple manner by broadening the microstrip line 
ai its end. It also includes the stray capacitance at the end of the line. 



resonance circuit is limited due to the comparatively low 
values of the input impedance Zj of the amplifier and of the 
series resistance rpo of the PD. A very rough approximation 
for the optimum value of L B is given by 

^Bopt«[(27r/c) 2 C PD ]- 1 . (3) 

Moreover, it is shown in Fig. 6(c) how L B reduces jn at 
high frequencies in a typical preamplifier for a 10 Gb/s optical 
system [21]. Note that here j N is related to the internal node 
of the photodiode. 

IV. Intrinsic Circuit Design 

To limit the length of this paper, most of the examples given 
in this section are restricted to digital circuits. However, some 
of the basic ideas are also valid for analog circuits. 

A. Resistance Level and Power Consumption 

Usually the power consumption P of very high speed IC's 
with comparatively small-scale integration (SSI) primarily 
depends on whether or not the (high-speed) outputs of the 
circuits are heavily loaded and only in the second place by the 
high operating speed. The situation is qualitatively expressed 
by Fig. 7, which shows the basic relation between operating 
speed and power consumption for strong and weak loading, 
respectively. Strong loading, e.g., by 50-Q transmission lines, 6 
increases the power consumption drastically not only by the 
need of powerful output buffers but also by the increased 
power consumption of the circuit core and input stages (cf. 
Fig, 2). The main reason for this fact is the severely reduced 
decoupling capability of the EF's at high speed. 7 In order to 
improve decoupling between external load and internal circuit 
nodes, power-consuming cascaded EF's and sometimes (at the 
speed limit) even two current-switch stages are required for 
the output buffer [cf. Fig. 2(c)]. Even then the loading of the 
internal circuit nodes is still comparatively high. Therefore, 
these nodes must be low-ohmic resulting in an increased 
power consumption of the circuit core and the input stages, 
too. Again, the power consumption of these circuits is further 
increased due to the reduced decoupling capability of the EF's 
[cf. Fig. 2(a) and (b)]. 

In contrast, if the output of a high-speed circuit is only 
weakly loaded (e.g., by similar circuits on the chip rather than 
by 50-17 buffers) the power consumption can be drastically 
reduced (Fig. 7). Of course, if the circuit has to be operated 
near the speed limit of the technology, the last, e.g., 20% 
in speed has to be paid by an overproportional increase 
of power consumption. In order to confirm this statement 
experimentally, first a 2 : 1 static frequency divider was de- 
veloped on the base of a MS-D-FF and fabricated with a 
production technology [7]. This circuit was designed with the 
aim to achieve maximum speed. Despite the fact that both 
outputs of the internal MS-D-FF (latches) are heavily loaded 
by 50-ft output buffers a maximum input frequency of 19 
GHz was achieved (cf. Table I), however, at cost of a high 

6 This is the typical situation for high-speed SSI IC's in optical-fiber links. 
For these IC's power consumption is normally no limiting condition. 

7 This is because of the drastic reduction of current gain (|/?| w fr/f) at 
such high frequencies (e.g.. up to / = /r/2 or even above). 
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Fig. 7. Basic relation between operating speed and power consumption for 
different loading conditions. The power consumption of the output buffers in 
the case of strong loading, which would even increase the difference between 
both curves, is not considered. (For numerical examples and the definition of" 
P 0 cf. text.) 

power consumption of 160 mW (power consumption of the 
buffers not included). Then, in order to cancel the influence of 
peripheral loading, a 16: 1 divider was developed, whose first 
(speed limiting) stage is only weakly loaded by the second 
one [7]. Now it is possible to reduce the power consumption 
to only 2.5% (4 mW) of the first example, still achieving a 
maximum frequency of 10.5 GHz, i.e., of 55%. For this, we 
started from the high-speed divider keeping the voltage swings 
constant but applying the following measures (cf. [25]). 

• First, the emitter lengths l Eu (v stands for the different 
transistors) were reduced according Iev oc P, down to a 
power consumption at which the minimum emitter length 
given by the technology (Ie min) was achieved for the 
upper current switch transistors [cf. Fig. 2(a)]. Note that 
at not too small 1& V the transistor junction (+ oxide) 
capacitances C v are roughly proportional to l Ev and 
thus to P, thus keeping the corresponding time constants 
R V C V nearly constant (circuit resistances R v oc 1/P). 8 
This also holds for the ratio r B „/Rw 

• Next, some of the transistor configurations were changed 
from two base contacts to one, further reducing the collec- 
tor and substrate capacitance (but, of course, increasing 
r B ). 

• Then, the circuit diagram was simplified, especially the 
number of (power consuming) emitter followers was 
reduced. This allows to reduce the supply voltage (here 
from 5 to 3.5 V). 

The final power consumption of 4 mW is marked in Fig. 7 
by Pq. Only for P < P 0 where the transistor size can no 
longer be adjusted to the power consumption, the curve is 
going to approximate the well-known proportionality between 
speed and power (dashed line in Fig. 7). This onset of strong 
speed reduction with decreasing power can be shifted to lower 
values of P if the implantation dose of the SIC (Section II) is 
reduced, thus reducing Cqq [25]. 

Based on the loading considerations discussed in this section 
and starting with a typical (low) internal voltage swing (cf. 
Section IE- A), we get a feeling for the resistance level and 
transistor operating points. Finding the optimum collector 

8 However, it should be mentioned that the on-chip wiring and bondpad 
capacitances cannot be reduced correspondingly. 



currents, especially for the different emitter followers, requires 
some experience and careful simulations. 

In the case of analog circuits, e.g., broadband amplifiers, 
additional aspects for resistance optimization have to be con- 
sidered. For example, in the TIS of a preamplifier the feedback 
resistance R F is a compromise between low noise (— » high 
R F ) and high bandwidth (— ► reduced R F ) [21]. There is also 
an optimum for the collector current of the TIS transistor with 
respect to minimum broadband noise and maximum cutoff 
frequency, respectively. In the main amplifier cell of Fig. 5 
the optimum choice of the emitter series resistance Re is a 
compromise between wide dynamic range and high bandwidth 
on one side (-+ high Re) and low noise (first stage) as well 
as high gain on the other side (— > low Re)- The optimum 
choice of Re in this cell is mainly a tradeoff between high 
gain (-► high R F ) and high bandwidth (-> lower R F ) [10]. 
But it should be mentioned that at given total gain of a 
multistage amplifier, increasing the bandwidth of the single 
cells by reducing R F requires more cells and thus may again 
reduce the bandwidth of the complete amplifier. 

B. Optimization of the Transistors in High-Speed Circuits 

There is a great degree of freedom for the designer of 
monolithic integrated circuits in adjusting the transistor di- 
mensions (at given operating points) with respect to optimum 
circuit performance. Regrettably, this opportunity is often not 
exhausted by the designer. 

Before starting the transistor optimization, the minimum 
admissible emitter area must be calculated in dependence on 
the operating point (Jc, V C e)- It is determined by the critical 
collector current density j C K (cf. Section II) above which a 
strong increase of the transit time t/ (decrease of fx) can 
be observed due to high-current effects ("Kirk-effect" [26]). 
Therefore, this region is normally not allowed for high-speed 
transistor design (for exceptions see e.g., [8]). The value of 
jcK can be taken from measurements, from analytical relations 
(e.g., [27], [28]), 9 or from a mixture of both. It increases 
with increasing internal collector-emitter voltage VcEi, 10 as 
well as increasing doping concentration Nc and decreasing 
thickness wc of the collector region below the emitter, and it 
decreases with increasing junction temperature $j (e.g., [27], 
[28]). An experimental example for the dependence of t/ on 
the averaged collector current density jc = Ic/Ae (A e = 
emitter area) is given in Fig. 8 with VcEi as a parameter. 
At small emitter width b E the increase of r/ with jc is 
mitigated due to current spreading (e.g., [28] and [29]). As a 
consequence, the designer can use somewhat higher values for 
jCK compared to the wider emitter stripes. A corresponding 
correction factor is proposed in [29], 

From the worst-case (w.c.) values of Vcei and $j and 
considering the fabrication spread (especially of Nc and wc) 
we get the w.c. value for jcK- From this, with the w.c. value 

9 Per definition, jt'ck in [27] is slighdy higher compared to the value in the 
present paper which is given by the onset of the high-current region. 

10 Compared to the terminal voltage Vce. *e internal voltage V C Ei is 
reduced by the voltage drop across both the external collector resistance and 
emitter resistance. 
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Fig. 8. Transit time versus collector current density with the internal collec- 
tor-emitter voltage VcEi as a parameter. 

of ic, the minimum admissible emitter area can be calculated 

^Emin = /c/7CK, with j C K = f(VcEh$j, N C: W C )w.C. 

(4) 

For a given emitter stripe width b E the condition for the total 
emitter length 11 is l B > A Emin /b E . 

In most cases the optimum b E equals the minimum value 
Deo given by the technology because of the demand on a low 
base resistance (exceptions see below). Thus, starting from 
Ie == A Em i n /b E o the emitter length can be increased in order 
to find the optimal transistor size for circuit performance. This 
optimum is a compromise between the contradicting demands 
on low base and emitter resistance r B and r E (i.e., large 
l E ) on one side and low junction (+ oxide) capacitances 
Ceb,(7cb> and Ccs (i.e.. small l E ) on the other side. Usually, 
in high-speed circuits the dominating transistor parameters, 
besides r/, are r B and C C b. 12 Their dependence on l E (at 
t>E = b E o = const.) can be roughly expressed by 

r B oc l/l E and C C b oc l E (5) 

provided l E is not too small. As a rough rule of thumb, in 
digital circuits with a configuration similar to that in Fig. 2, 
the transistors in upper current switches (CS) are usually 
optimal for minimum emitter lengths, i.e., l E = A Emia /b E0 
(provided that the length-specific base resistance is not too 
high). This is mainly due to the strong influence of Ccb caused 
by the Miller effect and the dotting of two collectors which 
doubles the capacitive loading by C C b and Ccs- Minimum 
length is often also optimal for the first one of cascaded 
EF's since this transistor is already sufficiently long due to 
its comparatively low j C K (low VceO and because its output 
is rather weakly loaded. Moreover, a low Ccb is desired 
since, it either increases the loading of the upper internal CS 
or degrades transmission line matching at the circuit input 
(cf. Section III-C). In contrast, the second EF in double-EF 
stages is usually longer than its minimum value because it is 

11 In general, l E is the number of emitter fingers times the single emitter 
stripe length. 

12 However, in future IC's the influence of r E may strongly increase due to 
the ongoing reduction of b E (e.g., [30]) combined with an increase of ;'ck • 
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(strongly) loaded by a CS and needs, therefore, a low r B (low 
output impedance) rather than a low Ccb (no Miller effect). 
However, it should be noted that these high-speed design 
considerations for EF's may fail if ringing on top of the pulses 
occurs which, e.g., increases the time jitter. This potential 
problem of EF stages often requires a tradeoff between speed 
and stability in the design of EF transistors (cf. Section IV-C). 

In contrast to the design b E = b E o discussed up to now, the 
choice b E > b E o at A E = A Emin may be advantageous for 
transistors whose base resistance is of minor importance while 
Ccb and Ccs have a strong influence on circuit performance. 
Here, a reduction of l E (i.e., l E < A Emin /b E0 ) and thus 
of Ccb and Ccs may be advantageous. However, it should 
be mentioned that if b E exceeds b E o, t b increases stronger 
with reduced l E (= A Emin /b E ) than given by (5), as for the 
internal base resistance now.r B i oc b E /l E — A Emm /l% == 
b%/A Emm holds. A typical example for the design b E > 
b E Q are transistors in output buffers with comparatively high 
voltage swing AVq, where the output capacitance (mainly 
determined by C C b and Ccs) is of dominating influence. For 
example, for the laser/modulator driver in Table I, b E = 2b E o 
was chosen [8]. There, due to the high AVq, the collector 
current I c and thus A Em - m are high, resulting in a still 
sufficiendy low r B i. Another example where the design b E > 
b E0 may be advantageous are transistors in current sources. 
Again, low values of C CB and C C s are of main interest, while 
r B is normally of smaller influence. 

From these considerations, an optimization procedure for 
the transistors in a digital circuit can be derived (e.g., with 
the aim of a clear eye diagram at maximum possible data 
rate). The transistor dimensioning starts from a given circuit 
diagram (cf. Section IH-A) and, additionally, it is assumed 
that a first set of the resistances in the circuit has already 
been chosen (cf. Section IV-A) so that the operating points 
of the transistors are given. Then the following (simplified) 
procedure is recommended. 

• Calculate the w.c. values of jcKu (minimum) and Iq v 
(maximum) for each transistor (*V stands for the differ- 
ent transistors). 

• Calculate the minimum emitter areas A Ev ^m = 

• Check whether Ic is below the maximum admissible 
current I M for the metal stripe of a single emitter finger. 
Otherwise, increase the number of emitter fingers (n E > 
IcIImY 

• Calculate the parameter sets of all transistors for A Eu = 
A Et ,, m in and minimum emitter stripe width 6^0 (i.e., 
l Ev = A El , tmtn /b E o). Check the eye diagrams and deter- 
mine the maximum data rate. This starting dimensioning 
normally proves to be already the optimum for transistors 
in the upper CS's and often also in the lower CS's and 
the first EF stages. 

• Lengthen the emitter stripes of the switching transis- 
tors (i.e., l Ev > A El/ , min /b E o) and check whether the 
data rate can be increased and the eye diagrams can 

13 For simplification, it is not considered here that, due to emitter-size 
tolerances, the nominal A EUtmin is slightly larger in order to meet (4) under 
w.c. conditions. 
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be improved, respectively. An improvement is expected 
especially by the second (third) EF. 
• Widen the emitter stripes (b Ev > b E o) of those transistors 
which are still at AevM* and adjust Ie v {= ^Ei/.min/ 
6e„). There may be some improvements by the transistors 
in output buffer stages with comparatively large voltage 
swing, in current sources, and sometimes in the lower 
CS*s driven by the clock. 
If the demanded data rate is hot achieved, an iteration loop 
in the design procedure of Fig. 1 is required, possibly starting 
from a modification of the circuit diagram and its resistances 
(e.g., increase of power consumption). However, if the data 
rate is far above the target value, the power consumption and 
thus the transistor sizes can be reduced. 

If the EFs in the circuit are strongly capacitively loaded at 
comparatively low static Jc, the peak collector current %c can 
exceed the static value considerably. Thus, if A E min from (4) 
is calculated by use of 7c» the transistor enters the high-current 
region for a short time which might increase the time jitter at 
maximum data rate [31]. On the other hand, if ic is used 
for calculating ylEmin the emitter may become unfavorably 
long. In this case, a reliable emitter design requires transistor 
models which are sufficiently accurate in the high-current 
region. However, such models are not always available [32]. In 
practice, the problem is mitigated by the facts that (as already 
mentioned) the optimum Ae of the second (third) EF is usually 
larger than Ae min and that the collector current peak of the 
EF's is normally very short. 

Up to now the transistor design was based on the condition 
3C < Jck- As mentioned before, the junction capacitances 
((?CBjCcs) m buffer stages with comparatively high output 
voltage swing dominate the operating speed. They can be 
reduced by choosing Ae < A Em in- In this case, a certain 
increase of the transit time rp by penetrating the high-current 
region may be tolerable [8]. 

For analog circuits, some additional aspects in transistor 
design have to be considered. For example, in the first stage 
of low-noise amplifiers, the choice Ie > Ae min/ b E o may be 
advantageous in order to reduce r B and r E and thus the noise 
figure. Moreover, in linear broadband amplifiers for high data 
rates, not only the magnitude of the gain but also the group 
delay must be approximately constant up to the required cutoff 
frequency fa. Otherwise the time jitter grows unacceptably 
high. A rough condition is given by [21] 

/c/GHz > 0.75 x (data rate)/Gb/s. . (6) 

These conditions must be met by a careful optimization of 
the dimensions and thus the parameters of the transistors (e.g., 
[21] and [10]). 

This discussion on transistor optimization shows that the 
model parameters are required for many quite different-sized 
transistors. If all these parameters have to be determined 
experimentally by parameter extraction methods the measuring 
effort is enormous. Moreover, after even a small change in 
technology, ail transistors in the library have to be fabricated 
and measured again in order to get the new parameter set. 
Therefore, it is advantageous to use a "semiphysicaT transistor 



model. It utilizes the (comparatively simple) relations between 
the model parameters (base resistance, junction capacitances, 
etc.) on one hand and the transistor dimensions as well 
as specific electrical and technological parameters on the 
other hand [33]. Such specific parameters are, e.g., the sheet 
resistances of the internal base, of the polysilicon layers, and 
of the buried layer (in fi/D), the specific contact resistances 
(in 0/jm 2 and f2/un, respectively), the length-specific and 
area-specific values for the periphery and the bottom of the 
junction capacitances (in fF//im and fF//xm 2 , respectively), 
thickness and doping concentration of the epitaxial collector, 
etc. All these specific parameters as well as their temperature 
dependence can be experimentally determined by simple large- 
size test structures, so that the measuring effort and error are 
only small. From these data, the model parameters for arbitrary 
transistor configurations can now be calculated without need- 
ing a voluminous transistor library. Moreover, this is the only 
practical way to predict the influence of process tolerances 
on circuit behavior by simulation. A corresponding computer 
program called TRADICA was developed by our group [33]. 
It is intensively used for circuit design, also by the industry. 

C. Some Specific Design Problems 

The problems discussed in this section are restricted to the 
intrinsic circuit. Further problems concerning the layout and 
the mounting parasitics are discussed in Section V and VI, 
respectively. 

1) Transistor Modeling Aspects: The closer a circuit oper- 
ates to the speed limit of the technology used, the stronger 
are the demands on the accuracy of the transistor models. 
Accurate models which overcome the problems of the fre- 
quently used Gummel/Poon-model in SPICE are available 
for Si-bipolar transistors. 14 Such a model, named HICUM 
(from HIgh-CUrrent Model) has been described in [27]. It 
is semiphysical in the sense discussed in Section IV-B and 
can be used up to high current densities. Its extension for 
advanced transistors and operating frequencies up to fr has 
recently been published [35], [31], [29]. Some improvements 
of this "Advanced HICUM" compared to the original model 
shall briefly be discussed now. 

In very high speed circuits, nonquasi-static effects have to 
be considered more accurately, since the operating frequency 
is no longer low compared to the intrinsic transit frequency 
of the transistor, / T0 = (27TT/)" 1 . This was confirmed by 
numerous simulations. For example, these effects degrade the 
frequency response and stability of amplifiers and support 
potential ringing of cascaded EF's even in digital circuits. 
This nonquasi-static behavior of the internal transistor is taken 
into account in the Advanced HICUM by additional delay 
times (ti,T2) for the emitter diffusion capacitance Cd e and 
the intrinsic transconductance g m (cf. [44]) 

C de = Cdeo exp( - j ljt 2 ) , (7a) 
9m = 9mo expl-ju^Ti + t 2 )]. (7b) 

14 A critical discussion of the different models is given in [32] and [34]. 
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As a consequence, the usual frequency dependency of the 
current gain 0 is modified by multiplication with exp(-jvri). 
Since T\ and r 2 cannot be measured accurately, we determine 
these parameters by device simulations based on the transistor 
doping profile. 15 

As another improvement of the Advanced HICUM, the 
increasing influence of the peripheral active transistor with de- 
creasing emitter width 6^ is considered more accurately, e.g., 
by a reduced effective internal base resistance r Bl . Moreover, 
the dynamic emitter current crowding is accurately modeled 
by shunting a capacitance to rei- 

Nowadays, there are still modeling aspects which have to 
be considered more carefully. As an example, the influence of 
a high-ohmic p-substrate is not considered with sufficient ac- 
curacy in transistor models. For example, at high frequencies, 
the transistor isolation box can no longer be modeled by the 
usual series connection of the junction capacitance Ccs and an 
ohmic series resistance (with the latter usually not known to 
the designer). One reason for this is the dielectric behavior of 
the substrate which can be considered by a capacitance shunted 
to the substrate resistance. Moreover, the spreading resistance 
of the channel stopper region must be considered carefully. 
An improved four-element equivalent circuit for the isolation 
box has been presented in [36] and [45] and confirmed by 
measurements up to 20 GHz. The model parameters are 
calculated from the layout and technological data by three- 
dimensional numerical simulation. 

The influence of inaccurate substrate modeling on transis- 
tor behavior is usually more pronounced in analog circuits 
compared to digital circuits. This also holds for the parasitic 
coupling between the different transistor isolation boxes and 
other circuit components, which may, e.g., degrade the fre- 
quency response and stability of high-gain amplifiers as well 
as the performance of mixed-mode ICs. The development 
of adequate methods for the consideration of the crosstalk 
problems in very-high-speed circuit designs are still in a 
preliminary stage (for references see, e.g., [45]). 

Another imperfection of most transistor models imple- 
mented in circuit simulators is that breakdown phenomena are 
not or not sufficiently considered. This is a severe problem 
especially in the design of high-speed ICs as discussed in the 
following section. 

2) The Breakdown Problem: An improvement of silicon 
bipolar technologies with respect to higher operating speed 
must be paid by a reduction of the breakdown voltages V C eo 
(open base) and V C bo (open emitter). The fastest production 
technologies have, e.g., Vceo « 3.5 to 5 V and V C bo « 12 
to 18 V. In advanced laboratory technologies these voltages 
can be considerably lower (e.g., V C eo ~ 2.5 V). Thus, the 
designer of very-high-speed ICs must learn to handle such low 
breakdown voltages. In several cases, high collector-emitter 
voltages Vce can, e.g., be simply avoided by connecting one or 
more emitter-base diodes in series to the collector. Problems 

,5 It should be mentioned thai the influence of nonquasi -static effects is 
typically increased (compared to the values given in 131] and [35]) in advanced 
high-speed technologies due to the increased (relative) contribution of the 
travelling time through the collector space charge region and, possibly, by a 
drift field in the base generated by a Ge gradient. 
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arise if high output voltage swings are required (e.g., [8]). 
Fortunately, the base of a fast switching transistor in a circuit 
is typically driven by a low-ohmic source (rather than by 
a current source) and, additionally, often an emitter series 
impedance exists. As a consequence, the relevant breakdown 
voltage is higher than Vceo- However, it should be mentioned 
that for Vce > Vceo the base current Ib changes its sign. The 
corresponding voltage drop across the internal base resistance 
r B i results in a pinch-in of the collector current and can lead 
to electrical second breakdown (e.g., [37], [38]). This effect, 
which reduces the maximum admissible Vce compared to 
f*Bi = 0, is the reason why for transistors in common-base 
configuration and in EF!s the practical breakdown voltage 
can be essentially lower than Vcbo- In the case of transistors 
with multiple emitter stripes, it may be advantageous to use 
emitter balance resistors in order to obtain a uniform current 
distribution. 

In conclusion, the usual condition Vce < Vceo is often not 
justified and can unnecessarily restrict the application range 
of a given technology. However, for Vce > Vceo accurate 
consideration of breakdown phenomena in circuit simulators, 
as, e.g., in [39], is highly recommended. 

3) Ringing Caused by Emitter Followers: Ringing is a spe- 
cial problem of EF's and has, therefore, to be considered 
carefully during circuit design. At the circuit input, the neg- 
ative real part of the EF's input impedance can, together 
with the input capacitance, the bond inductance, and the 
driving source impedance, even cause undamped oscillations. 
However, by use of the on-chip transmission-line matching, 
discussed in Section III, this problem can be reliably avoided. 

More critical is the potential "on-chip" ringing or (even 
undamped) oscillation. It can be caused by the low inductive 
output impedance of the driving EF or TIS (possibly sup- 
ported by an inductive behavior of the connection line, cf. 
Section V) and the capacitive input impedance (combined with 
a negative real part) of the loading EF, current switch (CS), 
or TAS. Mostly, the problem increases with the number of 
cascaded EF's and the length of the interconnection lines (cf. 
Section V). However, by a careful individual design of the 
transistors and their operating points, as well as of the on-chip 
wiring, these problems can be overcome, but often at cost of 
the pulse steepness and cutoff frequency, respectively. This 
also hoilds if a resistor is used in series to the base of the 
CS transistors. 

In digital circuits with differential operation, an adequate 
compromise between speed and stability is sometimes possible 
if that EF pair which directly drives the CS has only a 
comparatively small quiescent current. 16 As a consequence, 
during switching, the off-going EF transistor is switched off 
(ic « 0) for a short time, thus damping the ringing on the 
top of the pulse. The speed reduction by this measure is often 
tolerable since the peak current of the ongoing EF transistor is 
mainly determined by the input capacitances of the succeeding 
stage during the transient, rather than by its quiescent current. 

As an experimental example, Fig. 9 shows a section of the 
(single-ended) output pulse sequence of the laser/modulator 

16 Sometimes this measure may also be advantageous in (differential) 
limiting amplifiers. 
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Fig. 9. (a) Ringing of the output voltage of a driver circuit caused by EF's. 
(b) Avoidance of ringing by the measures discussed in the text. Data rate 10 
Gb/s (200 ps/div.). 



driver in Table I [8]. This circuit consists of two cascaded cur- 
rent switches, each driven by three EF stages (cf. Section VII- 
A). In Fig. 9(a), a severe ringing is observed which drastically 
degrades the eye diagram. This ringing, which has already 
been detected by simulation during circuit design, is caused 
by the first current switch and the driving three EF's. By 
reducing the quiescent current of the third EF, as discussed 
before, the ringing is sufficiently damped without essential 
degradation of the pulse steepness as shown in Fig. 9(b). A 
similar measure was, e.g., applied in the two-stage output 
buffer of the decision circuits in Table I [cf. Fig. 2(c)] [6]. 
As an additional advantage of this method, the quiescent EF 
current can still be readjusted after chip mounting by simply 
using an external resistor. 



V. Layout Aspects 

For differentially operating digital circuits and broadband 
amplifiers, a highly symmetrical layout should be aimed 
at. By this measure, the advantage of differential operation 
with respect to drastically reduced electrical and thermal 
interactions (via the p-substrate and the bond wires) as well 
as small time jitter (cf. Fig. 3) can be fully exhausted. In 
addition, a large ground contact (e.g., realized as a wide ground 
ring around the chip) with a sufficient number of bond pads 
is recommended. This measure reduces the critical parasitic 
ground inductance caused by on-chip wiring and bond wires, 
which may result in common mode noise. 

In single-ended amplifiers, the situation can be quite dif- 
ferent. For example, interactions via a common ground bond 
wire must often be reduced by using a separate bond wire for 
the ground of the first amplifier cell. While on-chip decoupling 
of the supply voltage can simplify the off-chip decoupling in 
differential amplifiers (e.g., [10]) it is often not recommended 
for single-ended amplifiers. This is due to the influence of a 
parasitic resonance circuits built by the on-chip . decoupling 
capacitance and parasitic inductances (caused by both bond 
wires and on-chip wiring). But even if no extra on-chip de- 
coupling capacitance is applied, a similar effect can be caused 
by the parasitic capacitance between ground metallization 
and substrate (possibly increased by the collector-substrate 
capacitance of the EF's), especially if a low-ohmic (p + ) 
substrate is used. Thus, minimization of both the parasitic 
inductances and capacitances (tradeoff) is necessary in order 
to shift the corresponding resonance frequency far above the 
operating frequency of the amplifier. As another problem, 



mainly at single-ended operation, potential noise coupling via 
the substrate must be considered. In principle, this coupling 
can be reduced by shielding. However, especially in the case of 
sensitive circuit nodes, we must carefully check whether this 
measure is really helpful or, on the contrary, even degrades 
circuit performance (e.g., due to a noisy shield potential). 17 

Another very important layout aspect is a careful design of 
signal lines on the chip. These lines can influence the circuit 
performance severely, a fact which is often underestimated. 
Since there is, for several reasons, a lower limit for the chip 
size (even for small-scale IC's) the signal path(s) between the 
input(s) and output(s) can usually not be chosen as short as 
desired. Maximum line lengths of several 100 pm are often 
unavoidable. For these reasons, accurate line modeling is a 
must. Therefore, we prefer to realize longer lines as microstrip 
lines with the signal stripe in one of the upper metallization 
levels and the ground plane in the lowest level. Such a line 
is fairly well defined (depending on the "quality" of the 
ground plane) and can be modeled accurately also by lumped 
elements. In contrast, modeling of lines running over the sub- 
strate is very complicated since the model parameters do not 
only depend on the inhomogeneous nature of the underlying 
substrate but also on neighboring regions (channel stopper, 
transistor boxes, substrate-contact and other metallized areas, 
etc.) and, moreover, on the line length. 

With the same argument of prime priority for accurate 
circuit simulation, we often use the buried layer as a grounded 
plane underneath signal pads (but, see footnote 17). As another 
advantage, the only small resistance (caused by the buried 
layer) in series to the pad capacitance may simplify the match- 
ing of the driving transmission line, discussed in Section III-C 
[Fig. 6(a)]. 

Optimum partitioning of a high-speed circuit may depend 
on thermal and crosstalk considerations. Another (often the 
dominating) partitioning aspect is to divide the total signal- 
line length between input and output pads into optimum 
sections. For this, we must know which lines can have a strong 
(normally negative) influence on circuit behavior ("critical" 
lines) and must, therefore, be considered carefully, and which 
lines have only a small influence ("uncritical" lines). As a. 
rough rule of thumb, normally the critical lines should be 
shortened at cost of the uncritical ones. As an example, the 
different line segments in the signal path of a digital circuit 
and of a broadband amplifier with respect to their influence 
on circuit behavior are characterized in Fig. 10. Uncritical are 
lines driven by a CS 18 or a TAS due to their high output 
impedance. In contrast, critical are lines driven by EF's or a 
TIS due to their low (inductive) output impedance and to the 
capacitive input impedance (usually combined with a negative 
real part) of a loading EF, CS, or TAS. The influence of 
these lines is most pronounced in low-ohmic (e.g., driver) 
circuits where the EF's are designed for a very low output 
impedance (low r B and high I c ). In this case, even small 

17 For example, shielding of the input bond pad by an underlying buried 
layer connected to the on-chip ground can result in severe coupling problems 
if the ground is noisy. In this case we must renounce this shield or connect it 
to another, less critical circuit node. 

18 As long as they are not part of a feedback path [as, e.g., in Fig. 2(a)). 
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Fig. 10. Characterization of different signal-line segments in high-speed 
ICs, c = critical, lc = less critical, uc = uncritical. The input resistor i?/ 
and especially the load resistor of the CS, Rc. are preferably located at the 
line end. 



line inductances can result in severe ringing and even in 
undamped oscillations. 19 In addition to shortening the lines, it 
is sometimes helpful to use microstrip lines with comparatively 
wide stripes, possibly combined with a thinner dielectric layer 
(use of neighbored metal layers for the signal line and the 
ground plane), in order to reduce the effective line inductance 
(of course at cost of an increased line capacitance). 

In Fig. 10 the input and output lines are characterized as 
"less critical," ranking between the critical and uncritical lines. 
Double arrows indicate that the on-chip output resistor Rq can 
be alternatively located at both ends of the line, resulting in a 
somewhat different line influence on circuit behavior. In [10] 
and [111 a single EF is used at the amplifier input in order to 
reduce the equivalent input noise voltage density. 

There are a lot of other examples where on-chip wiring can 
lead to severe performance degradation, especially in low- 
ohmic circuits. However, their discussion would be beyond 
the scope of this paper. 

VI. Mounting Aspects 

First, it has to be pointed out that on-wafer measurements, 
which are often presented in publications, are useful for pres- 
election of the chips but not at all for the final characterization 
of a circuit. The reason for this is that in practice only mounted 
chips can be used and that, therefore, the mounting parasitics 
have to be considered already in the circuit design (cf. Fig. 1). 

For measurements, our chips are directly glued into a 
measuring socket and are conventionally ultrasonic bonded 
using 1-mil aluminum wires. The socket consists of a brass 
block, a dielectric substrate plate with 50-O microstrip lines, 
and SMA or K connectors. The substrate, which is soldered 
on the brass block, is either a ceramic plate with gold clad 
or a Teflon plate (PTFE) with copper clad on both sides. 
If possible, we prefer the latter substrate because it is less 
expensive and can easily be treated [5], [18]. Vias to the 
ground plane can be made easily by use of copper ribbons 

19 For example, in the case of the laser/modulator driver of Table I, line 
inductances as low as 20 pH behind each of the three EFs driving the output 
current switch [cf. Fig. 12(a)! would result in undamped oscillations. 
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Fig. 11. Measuring socket: (a) schematic section and (b) photograph of a 
mounted DEMUX chip with ground vias in each comer (chip size: 0.74 mm 
x 0.74 mm). 



soldered on both substrate sides. A section of the measuring 
socket and a practical example of a mounted chip (32 Gb/s 
DEMUX [6]) is shown in Fig. 11. 

The parasitic bond inductances (as far as they are undesired) 
are reduced by the following measures (cf. Fig. 11). 

• By milling a recess for the chip into the brass block the 
surfaces of the chip and the microstrip lines are at the 
same level, resulting in short bond wires. 

• In each corner of the chip there are ground pads which 
are all connected by short bond wires to adjacent pads 
(vias) on the substrate. If the ground pads on the chip 
must be more concentrated for some reasons, 20 multiple 
bonding pads should be used, providing a sufficient angle 
between the directions of the bond wires. 

• For differential operation, the effective (odd) bond in- 
ductance at the (complimentary) signal pads can be es- 
sentially reduced by use of parallel bond wires with 
small distance between them. This is because the mutual 
inductance increases. Formulas for the calculation of this 



20 For example, in the case of single-ended amplifiers, short bond wires are 
required not only to ground but also to external decoupling capacitors. These 
capacitors have to be arranged close to the chip so that ground bonding to all 
four comers of the chip [as in Fig. 1 l(b)]may no longer be possible. 
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effect are, e.g., given in [41]. They proved to be well 

suited for practical applications [18]. 
As a problem of the low-permittivity substrate, 50-S7 lines have 
comparatively wide stripes (750 ^m). Therefore, near the chip, 
narrower stripes must be used. However, for complimentary 
lines the (differential) characteristic impedance of 50 Q can 
be maintained by reducing the distance between both lines 
[cf. Fig. 11(b)]. 

The suitability of this mounting technique up to very high 
data rates was confirmed by numerous examples. For example, 
the 50 Gb/s time-division multiplexer, presented in the next 
section, was mounted in the same way as shown in Fig. 11 
with a single exception: the semirigid cables at the (50 Gb/s) 
output were directly soldered with the microstrip lines of 
the substrate, thus eliminating the connectors between them. 
This simple connection shows an excellent high-frequency 
performance. 



vii. Some Circuit examples and Measuring Results 

Some high-speed circuits designed by applying the tech- 
niques discussed in this paper have already been presented 
in Table I of the Introduction. In this section, three rather 
different kinds of circuits are selected, a 14 Gb/s driver circuit, 
a 13 Gb/s AGC broadband amplifier, and a 50 Gb/s MUX. For 
a fair comparison of the speed performance of these IC's, it has 
to be considered that the first two were fabricated in (different) 
production technologies, while for the third one, an advanced 
laboratory technology was used. 



A. Driver Circuit 

Fig. 12(a) shows the (simplified) diagram of a driver circuit 
developed for the transmitter of a 10 Gb/s optical-fiber link 
[8]. It was fabricated with the Siemens production technology 
B6HF [13]. Its single-ended output is able to drive both a laser 
diode directly or an external modulator. While in the first case 
the output current swing has to be varied within a wide range 
(here: 1 5 m A < AIq < 60 m A) a comparatively high voltage 
swing (here: AVq = 3.6 V p - P ) is required in the second case. 
The circuit consist of two current switches, which are (for the 
reasons discussed in Section III-A) each driven by three EF 
pairs. The basic problems of such a circuit and their solutions 
discussed in [8] shall briefly be mentioned. 
— Asymmetrical pulse shape due to the single-ended output. 
As a consequence, the eye diagrams can be severely 
degraded. By introduction of two offset voltages [caused 
by the currents /ofi and JoF2 in Fig- 12(a)] symmetrical 
output pulses can be generated. 
— Large output time constant due to the high voltage swing 
(tq <x AVq). Here, the only practical way to achieve the 
required quality of the output eye diagrams at 10 Gb/s and 
above was to steepen the pulse edges by a peaking bond 
inductor Lp («2 nH) at the output (cf. Section III-C). 
— Wide output current range for direct laser modulation. 
For this, in addition to the current source of the output 
stage, the currents through several other stages have to be 
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Fig. 12. Laser/modulator driver, (a) Circuit diagram, (b) Output eye diagram 
at 14 Gb/s (29 ps/div.). 



optimally varied via a single control input CON in order 
to get clear eye diagrams. 
— Exceeding the breakdown voltage Vceo(«3.7 V). This is 
a consequence of the high output voltage swing. Hie 
resulting maximum Vqe of 5.5 V can, however, be 
tolerated in the present case (cf. Section IV-C). 
The measured (single-ended) output eye diagram at 14 Gb/s 
is shown in Fig. 12(b). In this case, the current switched by 
the output transistors is as high as 108 mA, resulting in an 
output voltage swing of AVq = 3.6 V p - P (across an external 
load of R L = 50 U). 



B. AGC Broadband Amplifier 

Based on the amplifier cells discussed in Section III-B 
(Fig. 5) a main amplifier with automatic gain control (AGC) 
was developed for a 10 Gb/s optical-fiber link [10] and 
fabricated with an advanced version of Motorola* s production 
technology MOSAIC V [42]. The block diagram of the dc 
coupled and fully differential circuit is given in Fig. 13(a). 
The h.f. part consists of an high-ohmic input buffer IB, two 
controllable amplifier cells Al (cf. Fig. 5) and A2 (modified 
compared to Al [10]), a constant-gain cell A3, and two 
separated 50-f2 output buffers. The driving transmission line is 
well matched by a 50-Q input termination circuit IT which is 
combined with an offset-control circuit OC. For gain control, 
the output signal of A3 is detected by a peak detector D 
and compared with the nominal voltage (REF). The voltage 
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Fig. 13. Broadband AGC amplifier with two outputs, (a) Block diagram, (b) 
Gain versus frequency characteristics for a wide range of gain. 



difference Vqc is amplified by a high-gain dc amplifier which 
generates the two different control voltages for Al and A2. 

Aspects for the design of this amplifier have already been 
discussed in [10]. Besides the high data rate (>10 Gb/s) and 
the high gain (52 1 = 37 dB) also the demands on high 
input dynamic range (41 dB) and sufficiently low equivalent 
noise voltage density at the input (2.5 nV/y/Hz) had to be 
met. Fig. 13(b) shows the measured frequency response of 
the gain (|S 2 i|) with the low-frequency gain as a parameter. 21 
Despite the extremely wide gain range (60 dB!) the 3-dB cutoff 
frequency fc is nearly constant («10 GHz). Moreover, clear 
eye diagrams were measured up to 13 Gb/s at a constant output 
voltage swing of 500 mV p - p within the total input dynamic 
range [10]. 

C. Time-Division Multiplexer 

The circuit diagram of the 2:1 MUX, well proven in 
previous designs [5], is shown in Fig. 14(a) [4]. No separate 
output buffer is used as it would decrease the operating speed 
and increase the power consumption. In another version, a 
clock input stage with 50-Q on-chip matching resistors is used 
[cf. Fig. 2(b)], [6]. 

21 In this case the automatic-gain-control had been switched-off and Vqc 
was varied via an external control voltage rather than by the output of the 
peak detector. 
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Fig. 14. 2:1 MUX. (a) Circuit diagram, (b) Output eye diagrams at 40 and 
50 Gb/s (10 ps/div.). 

After fabrication in a laboratory technology [14], the chip 
was mounted on the measuring socket presented in Section VI. 
The 25 Gb/s input signals for driving the circuit were generated 
by multiplexing four pseudorandom pulse sequences with a 
word length of 2 15 - 1 b each [43]. The output eye diagrams 
measured at 40 and 50 Gb/s are shown in Fig. 14(b) with 
(differential) voltage swings of AVq = 1 V p - p and 0.6 V p _ p , 
respectively [4]. To the best of the authors' knowledge, 50 
Gb/s is the highest data rate ever achieved with an integrated 
circuit in any technology. 



VIII. Conclusion 

Design aspects for digital and analog Si-bipolar ICs operat- 
ing well above 10 Gb/s were presented and their usability was 
confirmed by measurements of practical ICs. Data rates up 
to 50 and 46 Gb/s were achieved with a MUX and DEMUX, 
respectively. However, we cannot conclude from these results 
that present Si technologies and design methods are already 
sufficient for all the other circuits (especially for the amplifiers 
and laser/modulator driver) in 40 Gb/s optical communication 
systems just under development [22]. To reach this goal, a 
further improvement not only of the technology but also of 
the mounting technique and of the models for transistor and 
circuit parasitics are required. In addition, the specification of 
the most critical circuits in 40 Gb/s systems must be relaxed 
compared to those given in Table I for 10 Gb/s systems. 
However, from their present designs, the authors expect that 
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the circuits principles and design aspects presented in this 
paper remain suitable also for such a high operating speed. 
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